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A  GENERALIZED  QUANTILE  ESTIMATOR  UNDER  CENSORING 


Y.  L.  Lio  and  V.  J.  Padgett 


Department  of  Statistics 
University  of  South  Carolina 
Columbia,  SC  29208 


ABSTRACT 

Based  on  right-censored  data  from  a  lifetime  distribution  Fq, 
a  smooth  alternative  to  the  product-limit  estimator  as  a 
nonparametric  quantile  estimator  of  a  population  quantile  is 
proposed.  The  estimator  is  a  "generalized  product-limit  quantile" 
obtained  by  averaging  appropriate  subsample  product-limit 
quantiles  over  all  subsamples  of  a  fixed  size.  Under  the  random 
censorship  model  and  some  conditions  on  Fq,  it  is  shown  that  the 
estimator  is  consistent  and  has  the  same  asymptotic  normal 
distribution  as  the  product-limit  quantile  estimator.  A  small 
Monte  Carlo  simulation  study  shows  that  there  exist  some  values  of 
the  subsample  size  for  which  the  estimator  performs  better  than 
the  product-limit  quantile  estimator  in  the  sense  of  estimated 
mean  squared  errors. 


1  ■  INTRODUCTION 

Arbitrarily  right-censored  data  arise  naturally  in  industrial 
life  testing  and  medical  follow-up  studies.  In  these  situations  it 
is  important  to  be  able  to  obtain  nonparametric  estimates  of 
various  characteristics  of  the  survival  function  S.  One 


characteristic  of  the  survival  distribution  that  is  of  interest  is 
the  quantile  function.  For  any  probability  distribution  function 
G,  the  quantile  function  is  defined  by  Q(p)»G~^(p)=  - 

inf{x:  G(x)  >  p} ,  0  <  p  <  1. 

For  a  random  (uncensored)  sample  X.,...,X  from  G,  the  sample 
_1  in 

quantile  function  Gr  (p)=inf{  x:  Gr(x)  >  p} ,  0<  p  <1,  has  been 
used  to  estimate  where  Gr(x)  denotes  the  empirical 

distribution  function.  Csorgo  (1983)  gave  many  of  the  known 
results  concerning  G”*(p).  Kaigh  and  Lachenbruch  (1982) 
considered  a  "generalized  sample  quantile"  obtained  by  averaging 
an  appropriate  subsample  quantile  over  all  subsamples  of  a  given 
size. 

For  arbitrarily  right-censored  data,  Sander  (1975)  proposed 
estimation  of  ^  by  the  quantile  function  of  the  product-limit 
(PL)  estimator  of  S.  She  and  Cheng  (1984)  obtained  some 
asymptotic  properties  of  that  estimator.  Reid  (1981)  studied 
Influence  functions  for  any  Frechet-differentiable  function  of  the 
PL  estimator,  gave  the  influence  function  of  the  PL  quantile 
function,  and  obtained  the  same  asymptotic  normality  for  the  PL 
quantile  function  as  Cheng  (1984)  did.  Padgett  (1986)  proposed  a 
kernel-type  estimator  which  smoothed  the  PL  quantile  function. 

The  intent  of  this  paper  is  to  propose  and  study  a 
generalized  PL  quantile  estimator  based  on  right-censored  data. 
This  generalized  PL  quantile  estimator  is  obtained  by  averaging 
subsample  PL  quantile  estimates  over  all  subsamples  with  subsample 
size  k  from  a  right-censored  sample  of  size  n,  where  1<  k<  n. 

Vhen  censoring  is  not  present,  this  generalized  PL  quantile 
estimator  becomes  the  generalized  quantile  estimator  proposed  by 
Kaigh  and  Lachenbruch  (1982). 

It  will  be  shown  that  under  some  nonrestrictive  conditions 
the  generalized  PL  quantile  estimator  satisfies  the  properties  of 
U-statistics,  and  the  asymptotic  normality  for  the  estimator  is 
presented  in  Section  3.  It  should  be  mentioned  that  the  order 
statistic  methods  used  by  Kaigh  and  Lachenbruch  (1982)  to  obtain 
an  expression  for  the  asymptotic  variance  of  the  generalized 
sample  quantile  function  for  uncensored  data  cannot  be  used  in  the 


case  of  right-censored  observations.  This  is  due  to  the  unequal 
random  jumps  in  the  PL  distribution  function.  Based  on  the 
results  of  a  small  Monte  Carlo  simulation  study  reported  in 
Section  4,  in  many  cases  there  is  a  subsample  size  k  for  which  the 
generalized  PL  quantile  estimator  performs  better  than  the  PL 
quantile  estimator  in  the  sense  of  smaller  estimated  mean  squared 
errors. 

For  the  generalized  PL  quantile  estimator,  one  problem  is  the 
optimal  choice  (in  some  sense)  of  the  subsample  size  k.  Since  no 
results  on  the  exact  mean  squared  error  of  the  proposed  estimator 
are  currently  available,  the  subsample  size  that  minimizes  the 
mean  squared  error  cannot  be  obtained.  Bootstrap  methods  for 
randomly  right-censored  data  (Efron  1981)  might  be  used  in  some 
cases,  however,  to  estimate  the  optimal  subsample  size  from  the 
data  but  would  require  extensive  amounts  of  computer  time.  This 
procedure  is  still  under  study. 

2.  GENERALIZED  PL  QUANTILE  ESTIMATOR 

Let  x!j\...,X®  be  the  true  survival  times  of  n  items  or 
individuals  that  are  censored  on  the  right  by  a  sequence 
U.,...,U_,  which  are  independent  random  variables  with  identical 

In  q 

distribution  H  (usually  unknown).  It  is  assumed  that  the  Xj's  are 
nonnegative  independent  identically  distributed  random  variables 
with  common  unknown  distribution  function  Fq  and  unknown  quantile 
function  Q^(p)»^«inf (t:  Fq(0>p},  0<p<l. 

The  observed  right-censored  data  are  denoted  by  the  pairs 
(Xj,  Aj),  i-l,...,n,  where 

XA  -min{xj,  l^},  ^-1  if  X°  <  Uj 

-0  if  X°  >  Ur 

Thus,  it  is  known  which  observations  are  times  of  failure  or  death 
and  which  ones  are  censored  or  loss  times.  For  this  model  the 
distribution  function  of  X^  is  F*1-(1-Fq)(1-H) . 

Based  on  the  censored  sample  (X^,  A^ ) ,  i»l,2,...,n,  a  popular 
estimator  of  the  survival  function  l-Fg(t)  at  t>0  is  the  PL 
estimator,  proposed  by  Kaplan  and  Meier  (1958)  as  the 
"nonparametric  maximum  likelihood  estimator."  Efron  (1967)  showed 


4 


that  this  estimator  is  "self-consistent."  Let  (Z^,A^  ), 
i-l,...,n,  denote  the  ordered  X^'s  along  with  their  corresponding 
A^'s.  Then  the  PL  estimator  of  l-Fg(t) 
is  defined  by 


pn<t> 


1, 


k-1 

n 

l-i 


(• 


n  -  i 
n  -  i  +1 


0  <  t  <  zr 

Zk-1  <  1  -  Zk’ 
k-  2 , . . . ,  n 


-  0,  Z  <  t. 

n 

Denote  the  PL  estimator  of  Fg(t)  by  Fr  -  1-PR. 

Based  on  randomly  right-censored  data,  it  is  natural  to 

estimate  the  quantile  function  Q^(p)  by  the  PL  quantile  function 

Qn(X1,A1,...,Xn,dnJp)^n(p)-inf{t:  FR(t)>p}.  Cheng  (1984) 

obtained  asymptotic  normality  results  for  Q  .  Reid  (1981)  got  the 

influence  curve  for  Q  . 

n  0 

A  generalized  PL  quantile  estimator  of  Q  (p),  0<p<l,  based  on 
the  randomly  right-censored  observations  (X^,Aj),  1-1,..., n  is 
defined  as  follows: 

For  a  fixed  integer  k,  1  <  k  <  n,  consider  the  selection  of  a 
simple  random  sample  (X^^, A^^),!-!, . . . ,k  (without  replacement) 
from  the  right-censored  data  (X^,Aj),  i«l,...,n.  By  using  this 
subsample  i-l,...,k,  the  PL  pth  quantile  estimator 

Qk*Xkl' ^kl’  *  ‘  ’  ,Xkk’  Akk;p^k^p^’  0  S  P  <  1»  ls  obtained  as  defined 

above.  Then  the  generalized  PL  quantile  K  .  ,  0  <  p  <  1,  is 

P  t  n 

defined  to  be  the  average  of  the  subsample  PL  pth  quantile 
estimators  over  all  (£)  subsamples  of  size  k  from  (X^.A.), 
i«l,...,n.  Therefore 


Cp;k;n  *  (n.  Z  Qk(p? 1 

V  Cn 


(2.1) 


where  C  indicates  that  the  summation  is  over  all  combinations 
n 

{kl,...,kk}  of  k  integers  selected  from  (l,...,n).  Subject  to  the 

obvious  constraint  14t<n,  the  assumed  subsample  size  is  arbitrary 

and  the  choice  k-n  in  (2.1)  gives  K  .  «Q  .  Thus,  the  statistics 

p  |  ic  f  n  n 

defined  by  (2.1)  form  a  collection  of  "generalized  quantile 
estimators"  which  includes  the  usual  PL  quantile  estimator.  This 


estimator  (2.1)  is  obviously  a  U-statistic  (Hoeffding,  1984)  with 
kernel  which  is  symmetric  with  respect  to  (X^,  A^), . . .  ^X^.  A^). 

A  PL  quantile  is  not  in  general  an  unbiased  estimator  of 

the  corresponding  quantile  of  the  lifetime  distribution,  although 

Lio  and  Padgett  (1986)  have  shown  that  any  bias  becomes  negligible 

with  increasing  sample  size  k.  Appealing  to  a  monotonicity 

principle  would  suggest  that  the  subsampling  scheme  provides  an 

estimator  K  .  of  E  with  bias  magnitude  exceeding  that  of  the 
P » n  p» 

PL  quantile  estimator  Qn(X^,  . . . ,XR, A^jp) .  However,  it  would 
seem  plausible  also  that  the  averaging  procedure  might  result  in  a 
reduction  of  sampling  variability  which  is  adequate  to  decrease 
the  mean  squared  error  of  estimation.  This  propefty  will  be 
indicated  in  the  Monte  Carlo  simulations  reported  in  Section  4. 

Due  to  the  censoring,  K  .  is  not  a  simple  linear 

P  ?  n 

combination  of  order  statistics  of  lifetime  data.  Vhen  A.»l, 
i«l,..,n  (i.e.  no  observation  is  censored),  the  generalized  PL 
quantile  estimator  reduces  to  the  estimator  proposed  by  Kaigh  and 
Lachenbruch  (1982)  which  is  a  linear  combination  of  order 
statistics  of  the  random  sample.  They  then  used  order  statistic 
properties  to  get  the  asymptotic  variance  and  mean  squared  error 
for  their  generalized  quantile  estimator.  In  the  case  of  random 
right-censoring,  similar  results  for  the  variance,  bias,  and  mean 
squared  error  of  the  generalized  PL  quantile  estimator  and  the 
mean  squared  consistency  of  the  generalized  PL  quantile  estimator 
seem  to  be  difficult  to  obtain  under  general  conditions  on  Fq  and 
H.  Some  asymptotic  results,  however,  have  been  obtained  under 
reasonable  conditions  and  are  discussed  in  the  next  section. 


3.  SOME  ASYMPTOTIC  RESULTS 


In  this  section,  asymptotic  normality  for  the  generalized  PL 
quantile  estimator  is  presented.  The  somewhat  lengthy  proof  of 
Theorem  3.2  will  be  given  in  the  Appendix. 

For  a  distribution  function  G,  let  TG*sup{t:  G(t)<l}. 


bounded  first  derivative  on  a  neighborhood  of  p.  If  for  fixed 

l<k<n,  E((Qk(p))2)<  -,  then  n1/2(Kp;k.n  -  up;n)  - ->  Z  in 

distribution  as  n  where  -  E(K  ,  )  and  Z  is  a  normal 

p;n  p;k;n  » 

random  variable  vith  mean  zero  and  variance  a  .  « 

•  p  j  K 

k2var(E(Qk(p)|X1)). 

The  proof  of  Theorem  3.1  follows  from  U-statistic  properties 
of  Hoeffding  (1948). 

Due  to  censoring,  a  simple  expression  for  the  asymptotic 

variance  of  this  generalized  PL  quantile  estimator  K  .  is 

difficult  to  get,  and  the  bias  term  is  also  unknown.  Therefore, 

theoretical  comparison  of  this  generalized  PL  quantile  and  PL 

quantile  estimators  in  terms  of  asymptotic  variance  or  in  terms  of 

mean  squared  error  is  still  not  available.  However,  simulation 

results  presented  in  the  Section  4  indicate  a  range  of  possible 

values  of  subsample  size  k  for  which  the  mean  squared  errors  of 

K  .  are  less  than  those  of  the  PL  quantile  estimator  for  each 
p;K;n 

P- 

By  U-statistic  properties  (Hoeffding,  1948),  it  is  easy  to 

2 

show  for  fixed  l<k<n,  n  var(K  .  )  decreases  to  lower  limit  a  .  . 

P»k;n  p;x 

In  fact,  an  immediate  application  of  Theorems  5.1  and  5.2  of 
Hoeffding  (1948),  gives  the  following  corollary. 

Corollary  3.1.  For  0<  p  <  min  {  1,  and  fixed  14t<n, 

we  have  ap;k  <  n  var(Kp;k.n)  <  k  varQk(p). 


In  Theorem  3.1  and  Corollary  3.1  the  existence  of  second 

moments  of  the  PL  quantile  estimator  was  assumed.  Lio  and  Padgett 

*  2 

(1986)  has  proven  that  for  sufficiently  large  k,  E [ ( Qk ( p ) )  J  is 
finite  and  the  bias  becomes  negligible.  Therefore,  we  develop  a 
companion  result  to  Theorem  3.1  which  provides  the  asymptotic 
normality  of  the  generalized  PL  quantile  estimator  as  the 
subsample  size  also  increases  to  infinity. 

Theorem  3.2.  Let  p  be  such  that  0<p<  min{l,  TtI/„0.).  Suppose  H 

0  ’ 

and  Fq  are  continuous  and  Q  is  differentiable  in  a  neighborhood 


of  p  vith  bounded  first  derivative  on  a  neighborhood  of  p.  If 
-*  •  and  lim  inf(kn/n)  »  c  (  >  0)  as  n  ■>  ®,  where  c  is  some 
constant,  then 

n*^(K  .  -  Q^(p))  ■»  Z  in  distribution, 

P • Kn ' n 

where  Z  is  a  normal  random  variable  with  mean  zero  and  variance 
2  ,,  v2  ,p  dx 

«  -  (Q  (P))  (1-P)  Jq  - 2 - 0 -  • 

p  U  (l-x)Z(l-H(QU(x))) 

Note  that  the  limiting  distribution  in  Theorem  3.2  is  the 
same  as  that  for  Qn(p)  obtained  by  Cheng  (1984). 

4.  SOME  SIMULATION  RESULTS 

Often  only  small  samples  are  available  in  real  situations  due 

to  the  expense  or  difficulties  to  obtain  lifetime  data.  Hence, 

some  investigation  of  the  small  sample  properties  of  K  .  is 

P  !  *t  »  n 

needed.  Therefore,  a  small  Monte  Carlo  simulation  study  was 
performed  for  two  common  families  of  lifetime  distributions. 

These  distributions  are  the  exponential  distribution  with  density 
f(x)*»  3  exp(-(3x),  x>0,  (3-1,  and  the  Veibull  distribution  with 
density 

(+)•)•  ->0- 

(«x,3)-(0.5,l),  (2,1),  (2,5).  Two  censoring  distributions  H  were 

used:  exponential  with  density  h(u)«Xe~^u,  u>0,  X>0,  and  uniform 
on  the  interval  (0,  A),  A>0. 

The  parameter  X  of  the  censoring  distribution  was  determined 
to  give  either  30X  or  50X  censoring.  That  is,  X  was  determined  so 
that  the  probability  of  a  censored  observation,  pr(X^  >  U  )-0.3  or 
0.5,  at  least  approximately.  This  probability  was  calculated  by 
numerical  integration  using  the  midpoint  rule  when  it  could  not  be 
obtained  exactly.  The  value  of  X  is  reported  in  the  resulting 
table  for  each  case. 

Since  the  generalized  PL  quantile  estimator  is  not  a  linear 
combination  of  order  statistics  of  lifetime  data  and  it  can  not  be 
simplified,  the  simulation  procedure  was  the  following:  For  given 
sample  size  n,  let  l<k<n. 


I  I'Lt'i  i 


'w*  rvrrs^r^rvtrwnj* 


Step  1.  A  random  sample  of  size  n,  X^,...,X^,  vas  generated  from 
the  lifetime  distribution. 

Step  2.  A  random  sample  U^,...,Un  vas  generated  from  the 
censoring  distribution. 

Step  3.  The  censored  sample  (X^, A^) , . . . , (XR, An)  vas  obtained  by 


.minCxV.Uj) 


if  X.  -  Xj 

if  Xj  -  u.,- 


Step  4.  Take  a  random  sample  of  pairs  (X^,^),  i«l,...,k, 
(vithout  replacement)  from  (X^,A^),  i»l,...,n. 


Step  5. 


Order  the  X^'s,  k,  from  the  smallest  to  the 


Step  7. 


largest. 

Step  6.  Compute  the  PL  estimator  of  lifetime  distribution  based 
on  the  chosen  subsample  of  size  k. 

Step  7.  For  each  value  p-0.10,  0.25,  0.50,  0.75,  0.90,  0.95, 
compute  the  PL  quantile  estimator  based  on  the  chosen 
subsample. 

The  above  procedure  (from  step  4  to  step  7)  vas  repeated  for 
all  (£)  different  subsamples. 

Step  8.  Average  all  (P)  subsample  results  to  get  K  .  . 

k  p I K| n 

Repeat  step  1  through  step  8  for  N  samples. 

Step  9.  The  mean  squared  error,  sample  variance,  and  bias  of 

K  .  vas  computed  over  all  N  samples. 

P  t  k#  n 

Step  10.  For  each  l<k<n  and  each  p,  the  ratios  of  mean  squared 

error  for  PL  quantile  estimator  and  mean  squared  error 

for  K  , _ vere  calculated. 

p ,  k,  n 

For  given  sample  size  n,  this  simulation  procedure  needs 
2n  -  1  loops  for  each  sample,  therefore  a  large  amount  of  cpu  time 
vas  required  for  each  simulation.  For  example:  When  N=200,  n=20, 
the  required  cpu  time  is  over  10  days  on  a  DEC  VAX  11-750 
computer.  When  n=30,  the  cpu  time  is  over  30  days  on  a  DEC  VAX 
11-750  computer  and  hence  is  impractical.  It  seems  that  cpu  time 
increases  vith  exponential  rate  vhen  sample  size  n  increases. 
Therefore,  sample  sizes  of  n=10,  20  vere  chosen  in  the  simulation 
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study,  and  for  each  case  simulated  (i.e.  each  distribution,  p,  and 
sample  size  combination),  200  censored  samples  were  generated 
using  the  random  number  generator  in  the  International 
Mathematical  and  Statistical  Library  (IMSL,  1982)  on  an  FPS  264 
attached  processor  to  an  IBM  3081  computer.  For  each  simulation 
case  using  ns20,  N=200,  the  cpu  time  vas  24.75  seconds  on  that 
system. 

Some  of  the  results  of  the  simulations  are  shown  in  Tables 

4.1-4.12  which  contain  the  ratios  of  estimated  mean  squared  error 

for  PL  quantile  estimator  and  estimated  mean  squared  error  for  the 

generalized  PL  quantile  estimator,  MSE(Q  (p))/MSE(K  .  ). 

n  p  f  it  j  n 

In  each  case,  for  p<0.90,  there  is  a  k  value  for  which  K  , 

P»  *■»  n 

has  smaller  estimated  mean  squared  error  than  that  of  the  PL 

quantile  estimator.  In  particular,  this  is  true  for  several  k 

values  for  the  median  estimators  Kq  g  .  .  For  small  p  and  large 

p,  those  k  values  such  that  K  .  has  smaller  mean  squared  error 

p  y  K)  n 

than  the  PL  quantile  estimator  are  close  to  the  sample  size  n.  In 
all  cases,  the  subsample  size  k  giving  the  largest  ratio  of 
estimated  mean  squared  error  tends  to  decrease  with  p  up  to  about 
0.5,  and  then  increase  for  larger  p.  The  parameter  k  determines 
the  amount  of  smoothing  of  this  estimator,  small  k  indicating  more 
smoothing.  Therefore,  more  smoothing  is  needed  in  the  middle  of 
the  distribution  than  in  the  tails  to  decrease  the  mean  squared 
error  of  the  generalized  PL  quantile  estimator. 

Increasing  the  amount  of  censoring  from  30£  to  502  seems  to 
have  little  effect  on  the  estimated  ratio  of  mean  squared  errors, 
especially  for  large  values  of  p.  Also,  the  behavior  of  the 
estimator  is  similar  for  the  two  censoring  distributions  used  in 
the  simulations. 

5.  AN  EXAMPLE 

As  an  example  of  the  generalized  PL  quantile  estimator,  15 
observations  were  chosen  from  the  lung  cancer  data  (standard, 
squamous)  in  Data  Set  I  of  Kalbfleisch  and  Prentice  (1980, 
p.  223).  The  data,  transformed  to  "months,"  are  shown  in  Table 
5.1,  where  "+"  denotes  a  censored  observation.  So,  A^»Ag«0  and 
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s 


all  other  A.  's  are  one.  The  quantile  estimators  K  ,  computed 
1  p|Kjn 

from  this  data  with  k-8,10,13  are  shown  in  Figures  5. 1-5.3  along 
with  the  PL  quantile,  shoving  the  smoothing  that  has  been 
obtained. 


TABLE  5.1  EXAMPLE  DATA 


i 

1 

2 

3 

4 

5 

6 

7  8 

Zi 

0.27 

0.33 

0.37 

0.83+ 

1.40 

2.40 

2.73  3.33* 

i 

9 

10 

11 

12 

13 

14 

15 

Zi 

3.67 

3.93 

4.20 

4.80 

7.60 

10.47 

13.70 

APPENDIX 


In  order  to  prove  Theorem  3.2,  the  representation  of  Peterson 


(1977)  for  the  Kaplan-Meier  estimate  Fn(t)  of  the  lifetime 


distribution  is  used.  The  PL  p-th  quantile  estimate  0^(p)  based 
on  sample  (X^,A^>,  i*l,...,k,  is  considered  as  a  function 
V<sJ,sJ,p)  of  two  empirical  subsurvival  functions  S^,  s£,  vhere 

l-s£<t)«  4“  1  HVt,  V11' 


i*l 


l-s£< t)-  4”  E  I[Xi<t,  ij.0]. 
i*l 


The  corresponding  p-th  quantile  is  the  value  of  V  at  (SU,SC) 
where  Su(t)-PlXi>t,  ^-lj  and  SC(t)-P[Xi>t,  ^-0].  Let  (3  be  the 
space  of  subsurvival  functions,  i.e.  the  space  of  decreasing  left 
continuous  functions  from  R  into  [0,a],  vhere  a<l.  Let  I  lwi-Si  I  in 
sup  |  W^(x)-S^(x)  |  vhere  and  Sj  belong  to  3-  Let  3^  be  the 


0<x<« 


product  space  of  3  and  itself,  and  define  a  norm  on  3^  such  that 
for  any  V2  e  32  |  ^  -V2||2  - 1  l^1X-W21 1  li*  I  IW12“W22  1 1^  vhere 


Wjj  is  the  jth  component  of  W^,  j-1,2,  i-1,2.  Clearly,  |  -V2 1 1 


^HwirW2ill-+IIWi2“W22ll-*  Reid  (1981)  proVed  that  V  is 

Fr4chet-differentiable  at  Su  and  Sc  with  respect  to  ||*  ||—  in  each 

argument  Su  and  Sc  for  0<p<  min{l,  T^qOj},  and  has 
continuous  partial  derivatives.  Therefore,  V  is  Frechet- 
dif ferentiable  at  (SU,SC  )  with  repect  to  ||*||  for 


0<  p  <  min(l,  Th^qOj},  and 
V(Sk’  Sk’  p)  -v<sU»sC*P> 

■  nr  ^  ic1(v,su,scii!i)(p) 

-  -ir  *  ic2(v.s".sei*1)<p)  Its..oi 

1*1  1 

-  J  IC1(V,SU,Sc;s)(p)dSU(s)-  J  IC2(V,SU,Sc;s)(p)dSc(s) 

+  o< |  |Sk-S  1 1),  (A.l) 

where  Sk=(sjJ,s£)  and  S»(SU,SC).  Now, 

~~  l  %  (P)  -  V(SU,Sc,p) 

<  £)  " 

■  n 

- JT"  t  l  V(S“  ,  s£  ,p)-V(SU,SC,p)] 

(  u  )  n  n 

ri 

I  IC1(V,Su,Sc;X1)(p)  If. 

(J)  C  n  i«l  1  1  l*i  1J 

n 

"  X  ifl  IC2(V’sU’sCjXi)(p>}  I|Si-0J> 

+  ( -jr  1  Hsk  -s  II  > 0(1) 

)  n 

*n 

■u„  ♦<-£-,  >IM  ll)o(i), 

(  k  )  n 

n 

where  ICj  and  IC2  are  influence  curves  of  the  PL  quantile  function 
(see  Reid,  1981). 

Ve  need  the  following  lemma  to  complete  the  proof. 

1/2 

Lemma  1.  n  U  - ■>  Z  in  distribution,  where  Z  is  a  normal 

n  2 

random  variable  with  mean  zero  and  variance  a  . 

P 

Proof.  U  is  a  U-statistic  with 
-  n 

h<Xr...,Xk^)-  — ~  j^IC2  I[5i-0] 

and  E(h)«0  (see  example  2  and  p.83  of  Reid,  1981). 
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I 


j ;  >,t  «.|  i  t  i>|  ■ 


'  «  *'4  «**  .u  .  it 


err 


i 


£ 

It  is  easy  to  shov  that  Var(U  )■  -  1°  (kn)  (P  kn)C , 

"  (  C  >  c-1  n~c 

n 

vhere  for  l<c<  kR 

Cc»E[h(X^, . . .  •  jX^  )*h(X^, . . .  ,Xc,Xjt  +l,**,^2k  -c^ 

I  /  icj  dFu(s)  +  J  IC2  dFc(s)  ] 
kn 

,  -£-  C2 

k2  V 

n 

Therefore,  Var(Un)  «-jj—  c2. 

Let  <  “  TT  <  hi<xi>  > 

1*1 

"n”  lih1Cl  II5i“1l+ifiIC2  II*i-°l} 

vhere  h-^x)  -  E[h(x,X2, . . . ,Xk  )].  Since  E[h2]  <  -  (Reid,  1981), 

using  the  same  argument  as  the  proof  of  Theorem  3.3.13  of  Randles 
and  Wolfe  (1979),  it  is  easy  to  get 

n  El<VVn)2]-nEl(Un)2l+  nElVn  2]  ~  2nEIUn  Vn  1 

-  n  Var(Un)  +  c2  -  2  k2  ^  -  0. 

1/7  ★ 

Reid  (1981)  has  proven  that  n  VR  - >  Z  in  distribution. 

1  /O 

Therefore  n  U  - >  Z  in  distribution.  // 

n 

Since  ||  Sk  -  S||  <  |  |S«  -  SC  ||.+  ||  sj  -SU  ||„ 
n  n  n 

and  kJ/2||S^-  Sc||.«  0p(l),  k*/2  |  |S^-  Su||„-  0p(l),  by 

using  a  basic  argument  the  folloving  Lemma  is  easy  to  prove. 

kj/2 

Lemma  2.  ~ —  [  Z  IK  -S\\)  -  0J1). 

(n  >  n  P 

n 

With  Lemma  1  and  Lemma  2  and  (A.l),  letting  kR  — >•  and 
lim  inf(kn/n)  «c  (>0)  for  some  constant  c  as  n — >  ®, 

Theorem  3.2  is  proven.  Ill 


5!W 


0  >s 
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